Visual Reasoning


ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models

Add code
May 19, 2025
Viaarxiv icon

Advancing Generalization Across a Variety of Abstract Visual Reasoning Tasks

Add code
May 19, 2025
Viaarxiv icon

Neurosymbolic Diffusion Models

Add code
May 19, 2025
Viaarxiv icon

ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models

Add code
May 19, 2025
Viaarxiv icon

Reasoning-OCR: Can Large Multimodal Models Solve Complex Logical Reasoning Problems from OCR Cues?

Add code
May 19, 2025
Viaarxiv icon

FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning

Add code
May 19, 2025
Viaarxiv icon

TCC-Bench: Benchmarking the Traditional Chinese Culture Understanding Capabilities of MLLMs

Add code
May 19, 2025
Viaarxiv icon

Rethinking Predictive Modeling for LLM Routing: When Simple kNN Beats Complex Learned Routers

Add code
May 19, 2025
Viaarxiv icon

G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning

Add code
May 19, 2025
Viaarxiv icon

SSR: Enhancing Depth Perception in Vision-Language Models via Rationale-Guided Spatial Reasoning

Add code
May 18, 2025
Viaarxiv icon